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METHOD OF DYNAMICALLY COMPENSATING FOR VARIABLE 
TRANSMISSION DELAYS IN PACKET NETWORKS 

This invention relates to a method and apparatus for 
dynamically compensating for variable transmission delays 
5 in packet networks, particularly voice networks, but the 
invention is also applicable to other networks, such as 
video networks. The invention is applicable in all 
integrated packet networks where voice or video may be 
carried including, for example, frame relay networks, ATM 
10 networks, PCME (packet circuit multiplication equipment) , 
and LANs . 

In this specification, reference is made throughout 
to "voice" packets, since this is the term normally used 
in the art to describe such networks, although it will be 
15 realized by one skilled in the art that such networks 

extend to any network capable of transmitting any form of 
audio whether it actually be voice or other form of 
reproducible sound. 

Two methods have been proposed to compensate for 
20 variable transmission delays in packet voice networks. 

In the first method, known as timestamping, which is 
used in ITU standard G.764, the accumulated variable 
transmission delay experienced by a voice packet is 
recorded in a timestamp field in the packet. Each 

25 intermediate node recognizes voice packets and adds to 
the timestamp field the amount of time that it took for 
the packet to transit the node. The receiver uses the 
value in the timestamp field to determine when to play- 
out the voice packet . Voice packets that experience 

3 0 little delay in the network will be delayed in the 
receiver longer before being played-out, and voice 
packets that experienced long network delays will be 
delayed less in the receiver. The effect is that the sum 
of the network delay and the delay in the receiver will 



BNSDOCID: cWO 9S22233A1 J_> 



10 



15 



20 



25 



30 



35 



WO 95/22233 

PCT/CA95/00062 



- 2 - 

be nearly constant for all voice packets and rh. 
W.11 be Played-out at a uniform rLe . 

The disadvantage of this method is ^h.^ • 
nodes must recognize voice packets and -^ermediate 
processina • , Packets and carry out special 

P ocessxng. This makes this method incompatible vrLh 
exxst. etworks that do not support this Ln tion " 
Anot,,, disadvantage is that prior knowledge of the 
maxxmum expected delay variation is necessary. 

method" a\' "r"^ ^^-'^ ^elay 

method, a fixed delay is always added at th. • 

first packet of a talk sequence tL deli:^^^^^^ 

~ — - -ay expected from 

minimum delay the ::3;e: '"^^ ^^^^^^ experiences 

delay to m.l compensates by adding enough 

delay a.:::: ^'^^^ experience more 

ay, arrxve before their scheduled play-out time. 

The disadvantage of this method is that i^ m=. 
increase the delav ir, ""^^ 
value This is bee '"^f ^^^^ ^^^^^ optimal 

• because xf first packet has already 

experienced the wor=!r n^^.^ ^ =^J-teaay 

include th. ^''^ — ^^ded will 

include the worst case twice r =^r,^ ^ -, 

ITU T Recommendation G.114) . Delay is of special 
concern when speech and facsimile demodulated traffic 

::t"^ ^^"^ -a„.mission facility, ^i:::"-- 

networ. subject to variable transmLMon LlaT ^ '^'^'^^ 
compr i ^^^^^^^^^ ^^^^^^^^ ^ 

buffer,, applying a delay to the first packet of a 
sequence of packets,- applying a variable delay to 
subsequent packets of the sequence to produce an 
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appropriate constant play-out rate to reproduce the 
desired output; monitoring utilization of said buffer,- 
and adjusting the delay applied to the first packet of a 
following sequence of packets to maintain said 
utilization within desired limits to minimize the risk of 
said buffer underf lowing or overflowing. 

The utilization monitored can be, for example, the 
buffer -fill level or the dwell time of packets in the 
buffer. Alternatively, the arrival rate of the packets 
could be monitored. 

The packets may, for example, be voice packets or 
video packets . 

The invention (Adaptive Delay Equalization) thus 
uses an adaptive method to determine when to play-out 
received packets. This method minimizes the delay 
applied to received packets. In a preferred embodiment, 
the receiver starts by applying a pre -determined delay to 
the first packet of a talk-spurt. The receiver delays 
subsequent packets by an amount appropriate to produce a 
constant gap-free play-out rate. If the minimum number of 
packets in the buffer is large (i.e., the buffer is never 
close of under- flowing) , the system slowly reduces the 
predetermined delay, known as the build -out delay. if 
packets arrive late, the build- out delay is increased in 
order to minimize packet loss. The build-out delay 
adjustment can be done during speech silence. The 
duration of gaps between spoken words is precisely 
replicated by sending a "silence duration' value in the 
first packet of each new talk spurt . 

This method has the advantage that it minimizes the 
delay when the network is not congested, adapts itself to 
operate optimally in various network conditions, without 
any user reconfiguration, and it does not require any 
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complicated and specialized node h^nrn • 
packets (e.g. time- snapping, 

The invention also provides an ;,r,r, 
out packets receive, ehrough a pacLrLT": "'^^'"^ 
S va..a..e t.a„.....,cn .e.Js, co;.' .:ra"t,,t'r^ " 
receiving incoming packets- a cr,=l k ""^ 
receiving said paLetfr reconstituter for 

a saia packets from said buffer and 

reconstituting speech samples therefrom T 
unit for a applying a delay co thr ■ "^"^^^ ^^^-V 

0 control unit for cLtrollL T "'""^^"^ ^ 

^elay .o su.se-^ent'pack'Tts of the' se^^^cT ^ ^"'^'^ 
produce an appropriate constant ^1^;!:^: ::,:°,^= " 
buffer,, means for monitoring the utLr . 

' pa-\-\-f^rr-- 

^.htain said ^d^^::^-:^-^- " 

°^ -erfio„::g"or 

The invention will now be described 
by way Of e.a^le only, „i., referince o rhr'^^ '^^""^ 
accompanying drawings, in which :- 

transm's'rrn^e'L; ZlZ^' "'''"'^ ^ ^^^-^^ 

aexay packet voice network ; 

Figure 2 is a timing diagram of a packeti^.H 
transmission system in accordance with IT 

® wxth the invention; and 

Figure 3 is a block diagram of a variable d.i 
compensation apparatus m accordance .ith^t'il'Lon . 
As shown in Fioure i c.-^^ i. ■ 
Packeti.ed form though a^ ^ IZ^J^r^'-'' 

ay during transmission through the network. 
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The incoming packets are received by receiver 3, which 
outputs a re-assembled speech signal. 

The packet network 2 introduces a variable 
propagation delay A. Receiver 3 introduces a further 
delay 5 in the manner to be described. 

Referring now to Figure 2, input speech consists of 
spurts 4 separated by periods of silence 5 . Each spurt 4 
is represented by a sequence of packets 6, which when 
they are transmitted are separated by fixed spaces 7 as 
shown at line 8. However, after transmission through the 
network the packets are no longer equally spaced, as 
shown at line 9, due to the variable propagation delays 
in the network. In accordance with the invention, as 
shown at line 10, the first packet 6a of each sequence is 
subjected to a predetermined delay, which is estimated to 
be adequate to avoid buffer underflow and overflow. The 
remaining packets of the sequence are subjected to 
variable delays to maintain the appropriate constant 
output 11. This is then decoded to reproduce the initial 
speech as shown at line 12. 

Referring now to Figure 3, the incoming speech 
packets 6 are fed to variaJble delay unit 20, which 
introduces a variable delay between the packets . The 
output of variable delay unit 20 is fed to speech play- 
out buffer 21, which outputs the speech packets to speech 
reconstituter 22, which turns the speech packets into 
constant rate speech samples, normally at 8KHz . These 
speech samples are then converted into analog speech 
signals in digital-to-analog converter 23. 

Incoming speech packets 6 are also fed to packet 
analyzer 24 whose function is to identify the start of a 
speech spurt and trigger control unit 25, which sets the 
delay introduced by the variable delay unit 2 0 so as to 
produce a constant output rate. Buffer fill level 
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monitor 26 monitors the fill ^ 

buffer 71 r. ^. °^ speech play-out 

buffer 21. Dependxng on the fin level of buffer 2i 
control unit 2^ v;:,!-;^^ ^-u ■ . ^urrer 21, 

the next tal. spirt . ^"'^^ ^^-^ °^ 

Similar ult ^^^'^^-^ - 

similar unit monitoring the dwell time of th. ^ I 

the buffer Alr^^r, = ^ • n ^ packets in 

ui:rer. Alternatively, the buffer utin^=^- 

i-u cne rirst packet of a = 
detected by packet analyzer 24 Th» "I'^-^P'^" 

delays subsequent packets bv " ^''^^ 

yueiit packets by an amount approDriate rr, 
produce a constant play-out rato 

buffer 21 If ^ play-out 

- fio„in,", , t.e ;on:ror::r: 2TsT "t^^'- ^"-^^ ^° 

ou. delay. xf packets^ llT.TTi:"' 

risks under-flowing,, t.e buLd out delay' is I ^"^^ 
orde-r hr^ m-;^- • aeiay is increased in 

<->raer to minimize packet lo<?<5 a^-; ^ 

out ri^i=..r ^ Adjustment of the build- 

out delay can be determined in a number of ways su.h 
monitn-r-inrT f-i,^ • • '^^■^ wi. ways, such as 

monitoring the minimum, maximum or averaae urm^^.- 
the buffer Alr<=.r^ = .-- , average utilization of 

time a packetl^InT .Tt^e b^f ft^"^^^^^^ " ^ ^ 

spok is precslly-rrptraTeTbT T^Z.. 

uuration value in ^ho -f-;-.-^*- , *=-Lxeiice 
sm,^^ K- packet of each new talk 

spurt, which can be detected h^r .-h^ 

When the build-out delay has H ^ ^"^^^^-^ ■ 

duration is artificial v — sed, the silence 

IS artificially increased. The result is a 
larger buil.-out delay .uring the next talk spurt When 
the build-out .aelay has to be decreased, the silence 
duration is artificially decreased. The result is a 
Shorter build-out delay during the next talk spuL 



20 



25 



35 



Although the preferred embodiment has been described 
with reference to audio sicmal ^Ho • • ^^^cribed 

signal, the invention may also be 
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applied to video transmission. In this case video packets 
carry the video data, and the speech reconstituter is 
replaced by a video reconstituter, which operates in an 
analogous manner. Indeed the invention is applicable to 
5 any digitized physical signal that is transmitted in 

packetized format and then reconstituted at the far end. 
The video implementation looks the same as the 
implementation shown in the drawings with the word 
"video" si±>stituted for the word "speech" throughout . 
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b ZT ^"""-3 Packsts in a buffer- 

b) applying a delay to the fir=f , 
sequence of packets,- °^ a 

out rate .aL pacLtr o" '^"'''''"''^'^ ant play- 

^) -nitorinnL"tmLf '^^^"^ ""P" 
e) aijustina the d!, ° °' and 

o. a following sl^^.TlT ^T^l^l^^^ — 

buffer utilization „ithi„ 

Of .ai. .uffer underflowin/oi tirf L~^^ ^''^ 

2 • A method as claimed in claim i 

that in step d the fin level L ' ! /^""^^^^^--^ in 
monitored. °^ ^^^^ ^^^^^^ is 

3 • A method as claimed in claim i 

that in step d the dwell time orsaid r'^^'^^' 
i'uffer is monitored. Packets in said 

4 • A method as claimed in claim i 

that said packets sr-^ ■ ' =^^^^<=terized in 

packets are voxce or audio packets. 

5. A method as claimed in any of cla^n,. . 
characterized in that v, ^ to 4, 

Signal spurt. '"^^ ^^^^"^^ represents a 



30 7 



:.at :ar:ra;^:j::::l- ^' -aracteri.. in 

ao^ivity in the .a^. r^rei^r^rr:;---- 

' - A method as claimed in claim 
that a Signal inactivity durat^o! ' ' 
the first packet of each spurt " " 



BIMSOOCID: <WO 9S22233A1 I > 




PCT/CA95/00062 



WO 95/22233 



- 9 - 



8 . 



A method as claimed in claim 1, characterized in 
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that said packets are video packets . 

9 . An apparatus for playing out packets received 
through a network subject to variable transmission 
delays, comprising: 

a) a buffer for receiving incoming packets; 

b) a signal reconstituter for receiving said packets 
from said buffer and reconstituting sicfnal samples 
therefrom; 

c) a variable delay unit for applying a delay to the 
incoming packets; 

d) a control unit for controlling said delay to 
apply a first delay to an incoming sequence of packets 
and a variable delay to subsequent packets of the 
sequence so as to produce an appropriate constant play- 
out rate to said buffer; 

e) means for monitoring the utilization of said 
buffer; and 

f) means for adjusting said first delay applied to 
the packets of a following sequence of packets to 
maintain said buffer utilization within desired limits to 
minimize the risk of said buffer underflowing or 
overflowing. 

10. An apparatus as claimed in claim 9, characterized in 
that said means for monitoring the utilization of said 
buffer comprises a buffer fill level monitor. 

11 . An apparatus as claimed in claim 9 , characterized in 
that said means for monitoring the utilization of said 
buffer comprises a monitor determining the amount of time 
the packets spend in the buffer. 

12. An apparatus as claimed in claim 9, characterized in 
that it further comprises a packet analyzer for detecting 
the start of a signal spurt . 
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13 . An apparatus as claimed in claim 7 h 

that said delay adjustment is carrxer '^""^^^^^"^^^ - 

between signals to .e transmitted '""'"^ ^^^^ 

14 - An apparatus as claimed in claim 9 h 
that said packets are voice nr- t ' ^^^^^^^^^^^ed in 

voxce or audio packets. 

15. An apparatus as claimed in clai™ p . 

that said packets are video packets "^^"^^^--i-ed in 
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